Korpus: nds_wikipedia_2016

Weitere Korpora

3.6.2 Zipf's law for words of fixed lengths

Zipf distribution of words of fixed length 4, 6, 8, ..., 14.


Zipf's diagram for words of fixed length


Gnuplot diagram

Top Words of length 4
word rank frequency word
1 48596 hett
2 30923 weer
3 23611 sien
4 20481 sünd
5 15996 nich
Top Words of length 6
word rank frequency word
1 6246 Zensus
2 6119 annere
3 5218 Johren
4 5155 wedder
5 4631 Rebeet
Top Words of length 8
word rank frequency word
1 4984 Minschen
2 3795 Historie
3 3193 Engelsch
4 1920 Dezember
5 1817 twüschen
Top Words of length 10
word rank frequency word
1 3213 Demografie
2 1089 Süüdoosten
3 1002 Süüdwesten
4 835 verscheden
5 782 övernahmen
Top Words of length 12
word rank frequency word
1 1071 Inwahnertall
2 1070 Neddersassen
3 515 Nedderlannen
4 511 Produkschoon
5 439 Egenschoppen
Top Words of length 14
word rank frequency word
1 504 plattdüütschen
2 488 wohrschienlich
3 162 Unafhängigkeit
4 153 Tour de France
5 150 Sülvermedaille
Slope for length 4
Slope
-1.0987679001308097
Slope for length 6
Slope
-0.9097719677709344
Slope for length 8
Slope
-0.8170496196258504
Slope for length 10
Slope
-0.7555701710045468
Slope for length 12
Slope
-0.7996685664962445
Slope for length 14
Slope
-0.7403626894942437
1135 msec needed at 2018-01-07 11:17